Aggregating Regression Procedures for a Better Performance

نویسنده

Yuhong Yang

چکیده

Methods have been proposed to linearly combine candidate regression procedures to improve estimation accuraccy. Applications of these methods in many examples are very succeesful, pointing to the great potential of combining procedures. A fundamental question regarding combining procedure is: What is the potential gain and how much one needs to pay for it? A partial answer to this question is obtained by Juditsky and Nemirovski (1996) for the case when a large number of procedures are to be combined. We attempt to give a more general solution. Under a l 1 constrain on the linear coeecients, we show that for pursuing the best linear combination over n procedures, in terms of rate of convergence under the squared L 2 loss, one can pay a price of order O ? log n=n 1? when 0 < < 1=2 and a price of order O (log n=n) 1=2 when 1=2 < 1. These rates can not be improved or essentially improved in a uniform sense. This result suggests that one should be cautious in pursuing the best linear combination, because one may end up with paying a high price for nothing when linear combination in fact does not help. We show that with care in aggregation, the nal procedure can automatically avoid paying the high price for such a case and then behaves as well as the best candidate procedure in terms of rate of convergence.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Competitive On-line Linear Regression

We apply a general algorithm for merging prediction strategies (the Aggregating Algorithm) to the problem of linear regression with the square loss; our main assumption is that the response variable is bounded. It turns out that for this particular problem the Aggregating Algorithm resembles, but is slightly different from, the wellknown ridge estimation procedure. From general results about th...

متن کامل

Aggregated Estimators and Empirical Complexity for Least Square Regression

Numerous empirical results have shown that combining regression procedures can be a very efficient method. This work provides PAC bounds for the L2 generalization error of such methods. The interest of these bounds are twofold. First, it gives for any aggregating procedure a bound for the expected risk depending on the empirical risk and the empirical complexity measured by the Kullback-Leibler...

متن کامل

Bagging Binary and Quantile Predictors for Time Series: Further Issues

Bagging (bootstrap aggregating) is a smoothing method to improve predictive ability under the presence of parameter estimation uncertainty and model uncertainty. In Lee and Yang (2006), we examined how (equal-weighted and BMA-weighted) bagging works for onestep ahead binary prediction with an asymmetric cost function for time series, where we considered simple cases with particular choices of a...

متن کامل

SVM Aggregating Intelligence: SVM, SVM Ensemble, SVM Classification Tree, and Evolving SVM Classification Tree

This article scopes a concept of SVM aggregating intelligence as 3 levels research: aggregating for a better machine learning performance, aggregating for an adaptive/dynamic intelligent system, and aggregating for multitask and life-long continuous machine learning, and reviews existing SVM aggregating methods including SVM ensemble, SVM classification tree, and evolving SVM classification tree.

متن کامل

Precise Wind Power Prediction with SVM Ensemble Regression

In this work, we propose the use of support vector regression ensembles for wind power prediction. Ensemble methods often yield better classification and regression accuracy than classical machine learning algorithms and reduce the computational cost. In the field of wind power generation, the integration into the smart grid is only possible with a precise forecast computed in a reasonable time...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 1999

Aggregating Regression Procedures for a Better Performance

نویسنده

چکیده

منابع مشابه

Competitive On-line Linear Regression

Aggregated Estimators and Empirical Complexity for Least Square Regression

Bagging Binary and Quantile Predictors for Time Series: Further Issues

SVM Aggregating Intelligence: SVM, SVM Ensemble, SVM Classification Tree, and Evolving SVM Classification Tree

Precise Wind Power Prediction with SVM Ensemble Regression

عنوان ژورنال:

اشتراک گذاری